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Human Telomerase Gene 
INTRODUCTION 

Field Qf thg Invention 

The field of this invention is a human gene encoding an enzyme involved in cell 
replication. 

Background 

DNA at chromosome ends is maintained in a dynamic balance of loss and addition 
of telomeric simple sequence repeats. Sequence loss occurs during cell replication, in part 
from incomplete replication of chromosome termini by DNA-dependent DNA 
polymerase. Telomeric repeat addition is catalyzed by the enzyme telomerase: a 
ribonucleoprotein enzyme which uses a short region within the RNA as a template for the 
polymerase reaction. Although cells can maintain a constant number of telomeric repeats 
by balancing repeat loss and addition, not all cells do so. Human germline and cancer 
cells maintain a constant number of telomeric repeats, while normal human somatic cells 
lose telomeric repeats with each cycle of cell division. Cells which do not maintain stable 
telomere length demonstrate a limited proliferative capacity: these cells senesce after a 
number of population doublings correlated with the erosion of telomeres to a critical 
minimum length. 

Because normal somatic cells do not appear to express or require telomerase and 
do not maintain chromosome ends, and because all or almost all cancer cells express high 
levels of telomerase activity and maintain chromosome ends, molecules that inhibit or 
alter telomerase activity could provide effective and non-toxic anti-cancer agents. 
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Similarly, inhibition of telomerase in parasitic or infectious agents (e.g. trypanosomes, 
fungi, etc.) could provide a specific approach for reducing the viability or proliferation of 
these agents. Conversely, activation of telomerase in proliferation-restricted cells (such as 
normal somatic cells of the blood, vasculature, liver, skin, etc.) could provide a 
mechanism for promoting additional proliferative lifespan. 

Relevant Literature 

Purification of telomerase from the ciliate Tetrahymena and cloning of genes 
encoding two protein components of the enzyme is reported in Collins et al. (1995) Cell 
81, 677-686 and copending US patent application No. 08/359,125, filed 19 DEC 1994. 
Literature relating to human telomerase include Kim et al. (1994) Science 266, 201 1- 
2014; and Feng et al (1995) Science 269, 1236-1241. Literature relating to telomerase 
template modifications include Autexier et al. (1994) Genes and Devel 8, 563-575; Yu et 
al. (1991) Cell 67, 823-832; and Yu et al. (1990) Nature 344, 126-132. The Washington 
University-Merck EST Project contains an EST, reportedly deposited by Hillier et al. on 
Nov 1, 1995, which has sequence similarity with the 3' end of SEQ ID NO:3, disclosed 
herein. For a general review, see Blackburn et al., Eds. (1995) Telomeres, Cold Spring 
Harbor Laboratory Press. 

SUMMARY OF THE INVENTION 

The invention provides methods and compositions relating to a human telomerase 
and related nucleic acids. Included are four distinct human telomerase subunit proteins, 
called pl40, pl05, p48 and p43 and telomerase protein domains thereof having 
telomerase-specific activity. The proteins may be produced recombinantly from 
transformed host cells from the subject telomerase encoding nucleic acids or purified from 
human cells. Also included are human telomerase RNA components, as well as specific, 
functional derivatives thereof. 

The invention provides isolated telomerase hybridization probes and primers 
capable of specifically hybridizing with the disclosed telomerase gene, telomerase- 
specific binding agents such as specific antibodies, and methods of making and using the 
subject compositions in diagnosis (e.g. genetic hybridization screens for telomerase 
transcripts), therapy (e.g. gene therapy to modulate telomerase gene expression) and in the 
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biopharmaceutical industry (e.g. reagents for screening chemical libraries for lead 
pharmacological agents and nucleic acid polymerase reagents). 

SEQ ID LISTING 
SEQ ID NO: 1 : pi 05 protein (amino acid sequence) 
5 SEQ ID NO:2: pl05 ambiguity maximized synthetic DNA 

SEQ ID NO:3: pl05 natural cDNA (the coding region is bp 97-2370) 
SEQ ID NO:4: pi 05 E. coli optimized synthetic DNA 
SEQ ID NO:5: pi 05 mammalian optimized synthetic DNA 
SEQ ID NO:6: telomerase RNA 
10 SEQ ID NO:7: telomerase RNA template region modification 1 

SEQ ID NO:8: telomerase RNA template region modification 2 
SEQ ID NO:9: telomerase RNA template region modification 3 
SEQ ID NO: 10 p43 peptide (XXXEAAT[I/L]D[I/L]PQQGANK, where the three X's are 
indeterminant residues) 

15 

DETAILED DESCRIPTION OF THE INVENTION 
The invention provides isolated human telomerase proteins including human 
telomerase proteins pi 40, pi 05, p48 and p43, having molecular weights of about 140kD, 
about 105kD, about 48kD and about 43kD, respectively, as determined by polyacrylamide 

20 gel electrophoresis under denaturing conditions (Matsudaira and Burgess (1978) Anal 

Biochem 87, 386-396), and telomerase protein domains thereof. The telomerase proteins 
comprise assay-discernable functional domains including RNA recognition motifs and 
subunit binding domains and may be provided as fusion products, e.g. with non- 
telomerase polypeptides. The human telomerase proteins of the invention, including the 

25 subject protein domains, all have telomerase-specific activity or function. 

Telomerase-specific activity or function may be determined by convenient in 
vitro, cell-based, or in vivo assays: e.g. in vitro binding assays, cell culture assays, in 
animals (e.g. immune response, gene therapy, transgenics, etc.), etc. Binding assays 
encompass any assay where the molecular interaction of a telomerase protein with a 

30 binding target is evaluated. The binding target may be a natural intracellular binding 

target such as a telomerase subunit (e.g. another protein subunit or RNA subunit), a 
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substrate, agonist, antagonist, chaperone, or other regulator that directly modulates 
telomerase activity or its localization; or non-natural binding target such a specific 
immune protein such as an antibody, or a telomerase specific agent such as those 
identified in assays described below. Generally, telomerase-binding specificity is assayed 
by telomere polymerase activity (see, e.g. Collins et al. 1995, Cell 81, 677-686), by 
binding equilibrium constants (usually at least about 10 7 M'\ preferably at least about 10 8 
M"\ more preferably at least about 10 9 M' 1 ), by the ability of the subject protein to 
function as negative mutants in telomerase-expressing cells, to elicit telomerase specific 
antibody in a heterologous host (e.g a rodent or rabbit), etc. In any event, the telomerase 
binding specificity of the subject telomerase proteins necessarily distinguishes ciliate 
telomerase, preferably distinguishes non-mammalian telomerases and more preferably 
distinguishes non-human telomerases. Exemplary telomerase proteins which are shown 
to have telomerase binding specificity include the telomerase RNA (e.g. SEQ ID NO:6) 
binding domains (e.g. RRM 1-4: SEQ ID NO:l, about residues 5-81, residues 115-192, 
residues 336-420, and residues 487-578, respectively), telomerase primer binding 
domains, nucleotide triphosphate binding domains and binding domains of regulators of 
telomerase such as nuclear localization proteins, etc. As used herein, a protein domain 
comprises at least 12, preferably at least about 20, more preferably at least about 40, most 
preferably at least about 80 residues of the disclosed respective SEQ ID NO. 

The claimed human telomerase proteins are isolated or pure: an "isolated" protein 
is unaccompanied by at least some of the material with which it is associated in its natural 
state, preferably constituting at least about 0.5%, and more preferably at least about 5% by 
weight of the total protein in a given sample and a pure protein constitutes at least about 
90%, and preferably at least about 99% by weight of the total protein in a given sample. 
The telomerase proteins and protein domains may be synthesized, produced by 
recombinant technology, or purified from human cells. A wide variety of molecular and 
biochemical methods are available for biochemical synthesis, molecular expression and 
purification of the subject compositions, see e.g. Molecular Cloning, A Laboratory 
Manual (Sambrook, et al Cold Spring Harbor Laboratory), Current Protocols in 
Molecular Biology (Eds. Ausubel, et al, Greene Publ. Assoc., Wiley-Interscience, NY) or 
that are otherwise known in the art. An exemplary method for isolating each of human 
telomerase protein p!40, pl05, p48 and p43 from human cells is as follows: 



WO 98/01543 PCT/US97/12297 

Several thousand (two to twelve thousand) liters of HeLa cells are grown in 
spinner culture. The cells are lysed by dounce homogenization in low-salt buffer to 
produce crude cell lysates. The lysates are supplemented with 15% glycerol and 
centrifuged at 125,000 x g for 50 minutes to obtain a first soluble fraction enriched for 
telomerase activity (S-100 fraction). The S-100 fraction is adjusted to 0.2 M ammonium 
5 sulfate, bound to SP Sepharose 

(Pharmacia), and developed with a gradient in sodium chloride, to obtain a second soluble 
fraction enriched for telomerase (SP fraction). The SP fraction is adjusted to about 0.3- 
0.4 M ionic strength and bound to Q Sepharose (Pharmacia), and developed with a 
gradient in sodium chloride, to obtain a third soluble fraction enriched for telomerase (Q 

10 fraction). The Q fraction is adjusted to about 0.3-0.4 M ionic strength, bound to 

phosphocellulose (Whatman), and developed with sodium chloride, to obtain a fourth 
soluble fraction enriched for telomerase (PC fraction). The PC fraction is adjusted to 
about 0.3-0.4 M ionic strength, bound to 2'Omethyl RNA oligonucleotide immobilized on 
streptavidin agarose (Sigma), and eluted with a electrophoresis sample medium 

1 5 comprising 5% P-mercaptoethanol and 2% Sodium Dodecyl Sulfate to obtain a fifth 

soluble fraction (2'Omethyl fraction). The 2 , Omethyl fraction is separated by 
polyacrylamide gel electrophoresis under denaturing conditions (Matsudaira and Burgess 
(1978) Anal Biochem 87, 386-396) to obtain gel protein bands at a molecular weight of 
about 140kD, 105kD, 48kD or 43kD having telomerase activity. The gel bands are 

20 excised or blotted to obtain purified human telomerase proteins pi 40, pi 05, p48 and p43. 

The subject telomerase proteins find a wide variety of uses including use in 
isolating, enriching for and concentrating telomerase RNA and telomerase proteins, as 
immunogens, in the methods and applications described below, as reagents in the 
biotechnology industries, and in therapy. Recombinant telomerase are used in many 

25 applications where nascent oligonucleotides of predetermined sequence are desired. For 

example, native nucleic acid molecules are labeled or extended at their 3' ends by addition 
of a predetermined repeat sequence (for double-stranded oligonucleotides, both ends of 
the molecule may be tagged). Oligonucleotides complementary to the repeat are then 
used to amplify, sequence, affinity purify, etc. the nucleic acid molecules. The use of a 

30 repeat sequence for 3 1 end tagging improves specificity and provides sequence alternatives 

compared with non-templated enzymes presently available for this purpose, e.g. terminal 
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transferase. Repeats encoding restriction enzyme sites provide repeat tagging to facilitate 
cloning and the use of telomerase alleviates the restrictive conditions required for optimal 
ligation with available ligase enzymes. Telomerase also finds use in regulating cell 
growth or increasing cell density tolerance; for example, cells contacted with an effective 
amount of exogenous telomerase to overcome the growth control limitation otherwise 
imposed by short telomere length- Telomerase may be introduced, expressed, or 
repressed in specific populations of cells by any convenient way such as microinjection, 
promoter-specific expression of recombinant enzyme, targeted delivery of lipid vesicles, 
etc. Advantageously, only a brief period of telomerase activity is required to allow many 
generations of continued proliferation of the contacted cell, due to the ability of 
telomerase to extend telomeres in one cell cycle by more sequence than is lost with each 
cell division. 

The invention provides natural and non-natural human telomerase-specific binding 
agents including substrates, agonist, antagonist, etc., methods of identifying and making 
such agents, and their use in diagnosis, therapy and pharmaceutical development. For 
example, human telomerase-specific agents are useful in a variety of diagnostic and 
therapeutic applications. Novel human telomerase-specific binding agents include human 
telomerase-specific receptors, such as somatically recombined protein receptors like 
specific antibodies or T-cell antigen receptors (see, e.g Harlow and Lane (1988) 
Antibodies, A Laboratory Manual, Cold Spring Harbor Laboratory) and other natural 
intracellular binding agents identified with assays such as one-, two- and three-hybrid 
screens, non-natural intracellular binding agents identified in screens of chemical libraries 
such as described below, etc. Agents of particular interest modulate human telomerase 
function, e.g. human telomerase antagonists and find use methods for modulating the 
binding of a human telomerase or telomerase protein to a human telomerase binding 
target. 

For diagostic uses, the binding agents are frequently labeled, such as with 
fluorescent, radioactive, chemiluminescent, or other easily detectable molecules, either 
conjugated directly to the binding agent or conjugated to a probe specific for the binding 
agent. Binding agents also find use in modulating the telomerase activity present in a cell. 
For example, isolated cells, whole tissues, or individuals may be treated with a telomerase 
binding agent to activate, inhibit, or alter the specificity of telomerase assembly, 
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localization, substrate interaction, or synthesis activity. Effectively treated cells have 
increased or decreased replication potential, or suffer from loss of proper telomere 
structure (resulting in lethality). These binding agents also find therapeutic use to control 
cell proliferation; for example, the uncontrolled growth of transformed cells (e.g. cancer 
cells) is managed by administration to the cells or patient comprising such cells of a 
5 telomerase binding agent which reduces telomerase activity. In contrast to many current 

chemotherapies, the present invention provides enhanced specificity of lethality, with 
minimum toxicity to dividing yet normal somatic cells. 

The amino acid sequences of the disclosed telomerase proteins are used to back- 
translate telomerase protein-encoding nucleic acids optimized for selected expression 

10 systems (Holler et al. (1993) Gene 136, 323-328; Martin et al. (1995) Gene 154, 150-166) 

or used to generate degenerate oligonucleotide primers and probes for use in the isolation 
of natural telomerase encoding nucleic acid sequences ("GCG" software, Genetics 
Computer Group, Inc, Madison WI). As examples, SEQ ID NO:2 discloses an ambiguity- 
maximized pi 05 coding sequence encompassing all possible nucleic acids encoding the 

15 full-length pi 05 protein. SEQ ID NO:3 discloses a natural human cDNA sequence 

encoding pi 05, SEQ ID NO:4 is a pi 05 coding sequence codon-optimized for E. coli, 
SEQ ID NO:5 is a pi 05 coding sequence codon optimized for mammalian cell expression. 
Telomerase encoding nucleic acids may be part of human telomerase-expression vectors 
and may be incorporated into recombinant host cells, e.g. for expression and screening, 

20 transgenic animals, e.g. for functional studies such as the efficacy of candidate drugs for 

disease associated with human telomerase-mediated signal transduction, etc. Expression 
systems are selected and/or tailored to effect human telomerase protein structural and 
functional variants through alternative post-translational processing. 

The invention also provides nucleic acid hybridization probes and 

25 replication/amplification primers having a human telomerase cDNA specific sequence 

contained in SEQ ID NO:3, bases 1-2345, and sufficient to effect specific hybridization 
thereto (i.e. specifically hybridize with SEQ ID NO:3, bases 1-2345 in the presence of 
natural ciliate telomerase cDNA, preferably in the presence of non-mammalian telomerase 
cDNA and more preferably, in the presence of murine telomerase cDNA). Demonstrating 

30 specific hybridization generally requires stringent conditions, for example, hybridizing in 

a buffer comprising 30% formamide in 5 x SSPE (0.18 M NaCl, 0.01 M NaPO<, pH7.7, 
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0.001 M EDTA) buffer at a temperature of 42°C and remaining bound when subject to 
washing at 42°C with 0.2 x SSPE; preferably hybridizing in a buffer comprising 50% 
formamide in 5 x SSPE buffer at a temperature of 42 °C and remaining bound when 
subject to washing at 42 °C with 0.2 x SSPE buffer at 42 °C. Human telomerase cDNA 
homologs can also be distinguished from other protein using alignment algorithms, such 
as BLASTX (Altschul et al (1990) Basic Local Alignment Search Tool, J Mol Biol 215, 
403-410). 

The invention also provides non-natural sequence and isolated natural sequence 
human telomerase RNA. Natural human telomerase RNA sequences include the nucleic 
acid disclosed as SEQ ID NO:6, or a fragment thereof sufficient to specifically hybridize 
with a nucleic acid having the sequence defined by SEQ ID NO:6. Such fragments 
necessarily distinguish the previously described (Feng et al. 1995, Science 269, 1236- 
1241) human RNA species. Preferred such fragments comprise SEQ ID NO:6, bases 191- 
210, bases 245-259, bases 341-369 or bases 381-399. Non-natural sequences include 
derivatives and/or mutations of SEQ ID NO:6, where such derivatives/mutations provide 
alteration in template, protein binding, or other regions to effect altered telomerase 
substrate specificity or altered reaction product (e.g. any predetermined sequence), etc.; 
see, e.g. Autexier et al., 1994, Genes & Develop 8, 563-575; Collins et al. (1995) EMBO 
J. 14, 5422-5432; Greider et al. (1995) Structure and Biochemistry of Ciliate and 
Mammalian Telomerases, in DNA Replication, DePamphlis, Ed., Cold Spring Harbor 
Laboratory Press. Additional derivatives function as dominant negative fragments which 
effectively compete for telomerase assembly. For examples, SEQ ID NO:7, 8 and 9 are 
derivatives which provide for modified substrate specificity and polymerase reaction 
product to interfere with cellular function (see, e.g. Hanish et al. (1994) Proc Natl Acad 
Sci 91, 8861-8865). 

The subject nucleic acids are of synthetic/non-natural sequences and/or are 
isolated, i.e. unaccompanied by at least some of the material with which it is associated in 
its natural state, preferably constituting at least about 0.5%, preferably at least about 5% 
by weight of total nucleic acid present in a given fraction, and usually recombinant, 
meaning they comprise a non-natural sequence or a natural sequence joined to 
nucleotide(s) other than that which it is joined to on a natural chromosome. Nucleic 
acids comprising the nucleotide sequence of SEQ ID NO:3 or fragments thereof, contain 

8 
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such sequence or fragment at a terminus, immediately flanked by a sequence other than 
that which it is joined to on a natural chromosome, or flanked by a native flanking region 
fewer than 10 kb, preferably fewer than 2 kb, which is immediately flanked by a sequence 
other than that which it is joined to on a natural chromosome. While the nucleic acids are 
usually RNA or DNA, it is often advantageous to use nucleic acids comprising other 
bases or nucleotide analogs to provide modified stability, etc. The subject nucleic acids 
find a wide variety of applications including use as translatable transcripts, hybridization 
probes, PCR primers, diagnostic nucleic acids, etc.; use in detecting the presence of 
human telomerase genes and gene transcripts and in detecting or amplifying nucleic acids 
encoding additional human telomerase homologs and structural analogs. 

In diagnosis, human telomerase hybridization probes find use in identifying wild- 
type and mutant human telomerase alleles in clinical and laboratory samples. Mutant 
alleles are used to generate allele-specific oligonucleotide (ASO) probes for high- 
throughput clinical diagnoses. In therapy, therapeutic human telomerase nucleic acids are 
used to modulate cellular expression or intracellular concentration or availability of active 
telomerase. A wide variety of indications may be treated, either prophylactically or 
therapeutically with the subject compositions. For example, where limitation of cell 
growth is desired, e.g. neoproliferative disease, a reduction in telomerase expression is 
effected by introducing into the targeted cell type human telomerase nucleic acids which 
reduce the functional expression of human telomerase gene products (e.g. nucleic acids 
capable of inhibiting translation of a functional telomerase transcript). Conditions for 
treatment include various cancers, where any of a wide variety of cell types may be 
involved, restenosis, where vascular smooth muscle cells are involved, inflammatory 
disease states, where endothelial cells, inflammatory cells and glomerular cells are 
involved, myocardial infarction, where heart muscle cells are involved, glomerular 
nephritis, where kidney cells are involved, transplant rejection where endothelial cells are 
involved, infectious diseases such as HTV infection where certain immune cells and other 
infected cells are involved, or the like. 

Telomerase inhibitory nucleic acids are typically antisense: single-stranded 
sequences comprising complements of the disclosed natural telomerase coding sequences. 
Antisense modulation of the expression of a given telomerase protein may employ 
telomerase antisense nucleic acids operably linked to gene regulatory sequences. Cell are 
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transfected with a vector comprising a human telomerase sequence with a promoter 
sequence oriented such that transcription of the gene yields an antisense transcript capable 
of binding to endogenous human telomerase protein encoding mRNA. Transcription of 
the antisense nucleic acid may be constitutive or inducible and the vector may provide for 
stable extrachromosomal maintenance or integration. Alternatively, single-stranded 
antisense nucleic acids that bind to genomic DNA or mRNA encoding a given human 
telomerase protein may be administered to the target cell, in or temporarily isolated from 
a host, at a concentration that results in a substantial reduction in expression of the 
targeted protein. 

In other indications, e.g. certain hypersensitivities, atrophic diseases, etc., an 
increase in cell growth or proliferation is desired. In these applications, an enhancement 
in human telomerase expression is effected by introducing into the targeted cell type 
human telomerase nucleic acids which increase the functional expression of human 
telomerase gene products. Conditions for treatment include multiple sclerosis, where 
certain neuronal cells are involved, inflammatory disease states such as rheumatoid 
arthritis, where bystander cells are involved, transplant rejection where graft cells are 
involved, infectious diseases such as HIV infection where certain uninfected host cells are 
involved, or the like. Such nucleic acids may be human telomerase expression vectors, 
vectors which upregulate the functional expression of an endogenous human telomerase 
allele, or replacement vectors for targeted correction of human telomerase mutant alleles. 

Various techniques may be employed for introducing of the nucleic acids into 
viable cells, e.g. transfection with a retrovirus, viral coat protein-liposome mediated 
transfection. The techniques vary depending upon whether one is using the subject 
compositions in culture or in vivo in a host. In some situations it is desirable to provide 
the nucleic acid source with an agent which targets the target cells, such as an antibody 
specific for a surface membrane protein on the target cell, a ligand for a receptor on the 
target cell, etc. Where liposomes are employed, proteins which bind to a surface 
membrane protein associated with endocytosis may be used for targeting and/or to 
facilitate uptake, e.g. capsid proteins or fragments thereof tropic for a particular cell type, 
antibodies for proteins which undergo internalization in cycling, proteins that target 
intracellular localization and enhance intracellular half-life. 

The invention provides methods and compositions for enhancing the yield of 

10 
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many recombinantly produced proteins by increasing maximum cell densities and survival 
time of host production cells in culture. Specifically, cultured cells are transfected with 
nucleic acids which effect the up-regulation of endogenous telomerase or the expression 
of an exogenous telomerase. For example, nucleic acids encoding functional human 
telomerase operably linked to a transcriptional promoter are used to over-express the 
exogenous telomerase in the host cell. Telomerase-expressing cells demonstrate enhanced 
survival ability at elevated cell densities and over extended culture periods. 

The invention provides efficient methods of identifying agents, compounds or lead 
compounds for agents active at the level of a human telomerase modulatable cellular 
function. Generally, these screening methods involve assaying for compounds which 
modulate human telomerase interaction with a natural human telomerase binding target. 
A wide variety of assays for binding agents are provided including labeled in vitro 
telomere polymerase assays, protein-protein binding assays, immunoassays, cell based 
assays, etc. The methods are amenable to automated, cost-effective high throughput 
screening of chemical libraries for lead compounds. Identified reagents find use in the 
pharmaceutical industries for animal and human trials; for example, the reagents may be 
derivatized and rescreened in in vitro and in vivo assays to optimize activity and minimize 
toxicity for pharmaceutical development. Target indications may include infection, 
genetic disease, cell growth and regulatory disfunction, such as neoplasia, inflammation, 
hypersensitivity, etc. Target cells also include progenitor cells for repopulating blood or 
bone marrow, tissue grafts, and tissue subject to degredation/high turnover such as 
digestive and vascular endothelia and pulmunary and dermal epithelia. 

In vitro binding assays employ a mixture of components including a human 
telomerase protein, which may be part of multi-subunit telomerase, a fusion product with 
another peptide or polypeptide, e.g. a tag for detection or anchoring, etc. The assay 
mixtures comprise a natural intracellular human telomerase binding target, e.g. a 
substrate. While native binding targets may be used, it is frequently preferred to use 
portions (e.g. peptides, nucleic acid fragments) thereof so long as the portion provides 
binding affinity and avidity to the subject human telomerase conveniently measurable in 
the assay. The assay mixture also comprises a candidate pharmacological agent. 
Candidate agents encompass numerous chemical classes, though typically they are 
organic compounds; preferably small organic compounds and are obtained from a wide 

11 
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variety of sources including libraries of synthetic or natural compounds. A variety of 
other reagents may also be included in the mixture. These include reagents like salts, 
buffers, neutral proteins, e.g. albumin, detergents, protease inhibitors, nuclease inhibitors, 
antimicrobial agents, etc. may be used. 

The resultant mixture is incubated under conditions whereby, but for the presence 
of the candidate pharmacological agent, the human telomerase specifically binds the 
cellular binding target, portion or analog with a reference binding affinity. The mixture 
components can be added in any order that provides for the requisite bindings and 
incubations may be performed at any temperature which facilitates optimal binding. 
Incubation periods are likewise selected for optimal binding but also minimized to 
facilitate rapid, high-throughput screening. 

After incubation, the agent-biased binding between the human telomerase and one 
or more binding targets is detected by any convenient way. For cell-free binding type 
assays, a separation step is often used to separate bound from unbound components. 
Separation may be effected by precipitation (e.g. TCA precipitation, immunoprecipitation, 
etc.), immobilization (e.g on a solid substrate), etc., followed by washing by, for 
examples, membrane filtration (e.g. Whatman's P-81 ion exchange paper, Polyfiltronic's 
hydrophobic GFC membrane, etc.), gel chromatography (e.g. gel filtration, affinity, etc.). 
For telomere polymerase assays, binding is detected by a change in the polymerization by 
the telomerase of a nucleic acid or nucleic acid analog on the substrate. 

Detection may be effected in any convenient way. For cell-free binding assays, 
one of the components usually comprises or is coupled to a label. The label may provide 
for direct detection as radioactivity, luminescence, optical or electron density, etc. or 
indirect detection such as an epitope tag, an enzyme, etc. A variety of methods may be 
used to detect the label depending on the nature of the label and other assay components, 
e.g. through optical or electron density, radiative emissions, nonradiative energy transfers, 
etc. or indirectly detected with antibody conjugates, etc. 

A difference in the binding affinity of the human telomerase protein to the target 
in the absence of the agent as compared with the binding affinity in the presence of the 
agent indicates that the agent modulates the binding of the human telomerase protein to 
the human telomerase binding target. Analogously, in the cell-based transcription assay 
also described below, a difference in the human telomerase transcriptional induction in the 
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presence and absence of an agent indicates the agent modulates human telomerase- 
induced transcription. A difference, as used herein, is statistically significant and 
preferably represents at least a 50%, more preferably at least a 90% difference. 

The following examples are offered by way of illustration and not by way of 
limitation. 

EXAMPLES 

1 . Protocol for high-throughput human telomere polymerization assay. 

A. Reagents: 

- Neutralite Avidin : 20 ng/ml in PBS. 

- friman telomerase : 10' 8 - 10' 5 M human telomerase in PBS. 

- Blocking buffer : 5% BSA, 0.5% Tween 20 in PBS; 1 hour at room temperature. 

- Assay Buffer : 100 mM KC1, 20 mM HEPES pH 7.6, 1 mM MgCl 2 , 1 mM dATP, 
1 mM dTTP, 1% glycerol, 0.5% NP-40, 50 mM BME, 1 mg/ml BSA, cocktail of protease 
inhibitors. 

-[ 32 P]q-dGTP 1 Ox stock: 2 x 10 5 M "cold" dGTP with 100 ^Ci [ 32 P]a-dGTP. 
Place in the 4°C microfridge during screening. 

- t g)nmerase substrate : 10" 7 - lO^M biotinylated telomerase substrate (5-biotin- 

d(TTAGGG) 3 -3 , l in PBS. 

. Protease inhibitor cocktail f 1000XV 10 mg Trypsin Inhibitor (BMB # 109894), 
10 mg Aprotinin (BMB # 236624), 25 mg Benzamidine (Sigma # B-6506), 25 mg 
Leupeptin (BMB # 1017128), 10 mg APMSF (BMB # 917575), and 2mM NaVo 3 (Sigma 
#S-6508)in 10 ml of PBS. 

B . Preparation of assay plates: 

- Coat with 120 \il of stock N Avidin per well overnight at 4°C. 

- Wash 2 times with 200 ul PBS. 

- Block with 150 \i\ of blocking buffer. 

- Wash 2 times with 200 \i\ PBS. 

C. Assay: 

- Add 40 ^1 assay buffer/well. 

- Add 40 jil human telomerase (1-1000 fmoles/40 ul in assay buffer) 

- Add 10 ul compound or extract. 
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- Add 10 fil [ 32 P]a-dGTP lOx stock. 

- Add 40 \il biotinylated telomerase substrate (0.1-10 pmoles/40 ul in assay buffer) 

- Shake at 25 °C for 15 minutes. 

- Incubate additional 45 minutes at 25 °C. 

- Stop the reaction by washing 4 times with 200 pi PBS. 

- Add 150 scintillation cocktail. 

- Count in Topcount. 

D. Controls for all assays (located on each plate): 

a. Non-specific binding 

b. cold dGTP at 80% inhibition. 

2. Protocol for high throughput human telomerase subunit- RNA complex formation 
assay. 

A. Reagents: 

- Neutralite Avidin: 20 ng/ml in PBS. 

- Blocking buffer: 5% BSA, 0.5% Tween 20 in PBS; 1 hour at room temperature. 

- ASSAY Buffer 100 mM KC1, 20 mM HEPES pH 7.6, 1 mM MgCl 2 , 1% glycerol, 
0.5% NP-40, 50 mM P-mercaptoethanol, 1 mg/ml BSA, cocktail of protease inhibitors. 

■ human telomerase orotein lOx stock: 10 8 - 10 6 M "cold" human telomerase 
subunit (pi 05) supplemented with 200,000-250,000 cpm of labeled human telomerase 
(Beckman counter). Place in the 4°C microfridge during screening. 

- Protease inhibitor cocktail n OOpX)' 10 mg Trypsin Inhibitor (BMB # 109894), 
10 mg Aprotinin (BMB # 236624), 25 mg Benzamidine (Sigma # B-6506), 25 mg 
Leupeptin (BMB # 1017128), 10 mg APMSF (BMB #917575), and 2mMNaVo 3 (Sigma 
#S-6508) in 10 ml of PBS. 

- tglomgrasgRNA: 10 7 - lO^M biotinylated RNA (SEQ ID NO:6) in PBS. 

B . Preparation of assay plates: 

- Coat with 120 ^1 of stock N-Avidin per well overnight at 4°C. 

- Wash 2 times with 200 ul PBS. 

- Block with 150 *il of blocking buffer. 

- Wash 2 times with 200 \il PBS. 

C. Assay: 

14 



WO 98/01543 



PCIYUS97/12297 



- Add 40 \il assay buffer/well 

- Add 10 }il compound or extract. 

- Add 10 nl "P-human telomerase protein (20,000-25,000 cpm/0.1-10 pmoles/weil 
=10 9 - 10' 7 M final concentration). 

- Shake at 25 °C for 15 minutes. 

- Incubate additional 45 minutes at 25 °C. 

- Add 40 nl biotinylated RNA (0.1-10 pmoles/40 ul in assay buffer) 

- Incubate 1 hour at room temperature. 

- Stop the reaction by washing 4 times with 200 \il PBS. 

- Add 150 jil scintillation cocktail. 

- Count in Topcount. 

D. Controls for all assays (located on each plate): 

a. Non-specific binding 

b. Soluble (non-biotinylated telomerase) at 80% inhibition. 

All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. Although the 
foregoing invention has been described in some detail by way of illustration and example 
for purposes of clarity of understanding, it will be readily apparent to those of ordinary 
skill in the art in light of the teachings of this invention that certain changes and 
modifications may be made thereto without departing from the spirit or scope of the 
appended claims. 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 759 amino acids 

(B) TYPE: amino acid 
5 (C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE : peptide 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Met Ala Gly Leu Thr Leu Phe Val Gly Arg Leu Pro Pro Ser Ala Arg 
15 10 15 

Ser Glu Gin Leu Glu Glu Leu Phe Ser Gin Val Gly Pro Val Lys Gin 
20 25 30 

15 Cys Phe Val Val Thr Glu Lys Gly Ser Lys Ala Cys Arg Gly Phe Gly 

35 40 45 

Tyr Val Thr Phe Ser Met Leu Glu Asp Val Gin Arg Ala Leu Lys Glu 

50 55 60 

He Thr Thr Phe Glu Gly Cys Lys He Asn Val Thr Val Ala Lys Lys 
20 65 70 75 80 

Lys Leu Arg Asn Lys Thr Lys Glu Lys Gly Lys Asn Glu Asn Ser Glu 

85 90 95 

Cys Pro Lys Lys Glu Pro Lys Ala Lys Lys Ala Lys Val Ala Asp Lys 
100 105 110 

25 Lys Ala Arg Leu He He Arg Asn Leu Ser Phe Lys Cys Ser Glu Asp 

115 120 125 

Asp Leu Lys Thr Val Phe Ala Gin Phe Gly Ala Val Leu Glu Val Asn 

130 135 140 

He Pro Arg Lys Pro Asp Gly Lys Met Arg Gly Phe Gly Phe Val Gin 
30 145 150 155 160 

Phe Lys Asn Leu Leu Glu Ala Gly Lys Ala Leu Lys Gly Met Asn Met 

165 170 175 

Lys Glu He Lys Gly Arg Thr Val Ala Val Asp Trp Ala Val Ala Lys 
180 185 190 

35 Asp Lys Tyr Lys Asp Thr Gin Ser Val Ser Ala He Gly Glu Glu Lys 

195 200 205 

Ser His Glu Ser Lys His Gin Glu Ser Val Lys Lys Lys Gly Arg Glu 
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210 215 220 

Glu Glu Asp Met Glu Glu Glu Glu Asn Asp Asp Asp Asp Asp Asp Asp 
22 5 230 235 240 

Asp Glu Glu Asp Gly Val Phe Asp Asp Glu Asp Glu Glu Glu Glu Asn 

245 250 255 

He Glu Ser Lys Val Thr Lys Pro Val Gin He Gin Lys Arg Ala Val 

260 265 270 

Lys Arg Pro Ala Pro Ala Lys Ser Ser Asp His Ser Glu Glu Asp Ser 

275 280 285 

Asp Leu Glu Glu Ser Asp Ser He Asp Asp Gly Glu Glu Leu Ala Gin 

290 295 300 

Ser Asp Thr Ser Thr Glu Glu Gin Glu Asp Lys Ala Val Gin Val Ser 
305 310 315 320 

Asn Lys Lys Lys Arg Lys Leu Pro Ser Asp Val Asn Glu Gly Lys Thr 

325 330 335 

Val Phe He Arg Asn Leu Ser Phe Asp Ser Glu Glu Glu Glu Leu Gly 

340 345 350 

Glu Leu Leu Gin Gin Phe Gly Glu Leu Lys Tyr Val Arg He Val Leu 

355 360 365 

His Pro Asp Thr Glu His Ser Lys Gly Cys Ala Phe Ala Gin Phe Met 

370 375 380 

Thr Gin Glu Ala Ala Gin Lys Cys Leu Leu Ala Ala Ser Pro Glu Asn 
385 390 395 400 

Glu Ala Gly Gly Leu Lys Leu Asp Gly Arg Gin Leu Lys Val Asp Leu 

405 410 415 

Ala Val Thr Arg Asp Glu Ala Ala Lys Leu Gin Thr Thr Lys Val Lys 

420 425 430 

Lys Pro Thr Gly Thr Arg Asn Leu Tyr Leu Ala Arg Glu Gly Leu He 

435 440 445 

Arg Ala Gly Thr Lys Ala Ala Glu Gly Val Ser Ala Ala Asp Met Ala 

450 455 460 

Lys Arg Glu Arg Phe Glu Leu Leu Lys His Gin Lys Leu Lys Asp Gin 
465 470 475 480 

Asn He Phe Val Ser Arg Thr Arg Leu Cys Leu His Asn Leu Pro Lys 

485 490 495 

Ala Val Asp Asp Lys Gin Leu Arg Lys Leu Leu Leu Ser Ala Thr Ser 

500 505 sio 

Gly Glu Lys Gly Val Arg He Lys Glu Cys Arg Val Met Arg Asp Leu 
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515 520 525 

Lys Gly Val His Gly Asn Met Lys Gly Gin Ser Leu Gly Tyr Ala Phe 

530 535 540 

Ala Glu Phe Gin Glu His Glu His Ala Leu Lys Ala Leu Arg Leu lie 
545 550 555 560 

5 Asn Asn Asn Pro Glu He Phe Gly Pro Leu Lys Arg Pro He Val Glu 

565 570 575 

Phe Ser Leu Glu Asp Arg Arg Lys Leu Lys Met Lys Glu Leu Arg lie 

580 585 590 

Gin Arg Ser Leu Gin Lys Met Arg Ser Lys Pro Ala Thr Gly Glu Pro 
10 595 600 605 

Gin Lys Gly Gin Pro Glu Pro Ala Lys Asp Gin Gin Gin Lys Ala Ala 

610 615 620 

Gin His His Thr Glu Glu Gin Ser Lys Val Pro Pro Glu Gin Lys Arg 
625 630 635 640 

15 Lys Ala Gly Ser Thr Ser Trp Thr Gly Phe Gin Thr Lys Ala Glu Val 

645 650 655 

Glu Gin Val Glu Leu Pro Asp Gly Lys Lys Arg Arg Lys Val Leu Ala 

660 665 670 

Leu Pro Ser His Arg Gly Pro Lys He Arg Leu Arg Asp Lys Gly Lys 
20 675 680 685 

Val Lys Pro Val His Pro Lys Lys Pro Lys Pro Gin He Asn Gin Trp 

690 695 700 

Lys Gin Glu Lys Gin Gin Leu Ser Ser Glu Gin Val Ser Arg Lys Lys 
705 710 715 720 

25 Ala Lys Gly Asn Lys Thr Glu Thr Arg Phe Asn Gin Leu Val Glu Gin 

725 730 735 

Tyr Lys Gin Lys Leu Leu Gly Pro Ser Lys Gly Ala Pro Leu Ala Lys 

740 745 750 

Arg Ser Lys Trp Phe Asp Ser 
30 755 



(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 2277 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 





XViH\~2iX IN 1 1 


YGTNGGNMGN YTNCCNCCNW SNGCNMGNWS NGARCARYTN 


60 


n P H 21 0 V TMT 




NGGNCCNGTN AARCARTGYT TYGTNGTNAC NGARAARGGN 


120 




G X NGNGGN 1 1 


YGGNTAYGTN ACNTTYWSNA TGYTNGARGA YGTNCARMGN 


ieo 




AKATHACNAC 


NTTYGARGGN TGYAARATHA AYGTNACNGT NGCNAARAAR 


240 


AAKYTNMGNA 


AYAARACNAA 


RGARAARGGN AARAAYGARA AYWSNGARTG YCCNAARAAR 


300 


GARCCNAARG 


CNAARAARGC 


NAARGTNGCN GAYAARAARG CNMGNYTNAT HATHMGNAAY 


360 


YTNWSNTTYA 


ARTGYWSNGA 


RGAYGAYYTN AARACNGTNT TYGCNCARTT YGGNGCNGTN 


420 


YTNGARGTNA 


AYATHCCNMG 


NAARCCNGAY GGNAARATGM GNGGNTTYGG NTTYGTNCAR 


480 


TTYAARAAYY 


TNYTNGARGC 


NGGNAARGCN YTNAARGGNA TGAAYATGAA RGARATHAAR 


540 


GGNMGNACNG 


TNGCNGTNGA 


YTGGGCNGTN GCNAARGAYA ARTAYAARGA YACNCARWSN 


600 


GTNWSNGCNA 


THGGNGARGA 


RAARWSNCAY GARWSNAARC AYCARGARWS NGTNAARAAR 


660 


AARGGNMGNG 


ARGARGARGA 


YATGGARGAR GARGARAAYG AYGAYGAYGA YGAYGAYGAY 


720 


GAYGARGARG 


AYGGNGTNTT 


YGAYGAYGAR GAYGARGARG ARGARAAYAT HGARWSNAAR 


780 


G 1 IN ALNAAKL 


CNGTNCARAT 


HCARAARMGN GCNGTNAARM GNCCNGCNCC NGCNAARWSN 


840 


WaWtjAiCAlW 


SNGARGARGA 


YWSNGAYYTN GARGARWSNG AYWSNATHGA YGAYGGNGAR 


900 


uARi rNGCNC 


ARW 5 NGAYAC 


NWSNACNGAR GARCARGARG AYAARGCNGT NCARGTNWSN 


960 


AAYAARAARA 


ARMGNAARYT 


NCCNWSNGAY GTNAAYGARG GNAARACNGT NTTYATHMGN 


1020 


AAYYTNWSNT 


TYGAYWSNGA 


RGARGARGAR YTNGGNGARY TNYTNCARCA RTTYGGNGAR 


1080 


YTNAARTAYG 


TOMGNATHGT 


NYTNCAYCCN GAYACNGARC AYWSNAARGG NTGYGCNTTY 


1140 


GCNCARTTYA 


TGACNCARGA 


RGCNGCNCAR AARTGYYTNY TNGCNGCNWS NCCNGARAAY 


1200 


GAKG cng qng 


GNYTNAARYT 


NGAYGGNMGN CARYTNAARG TNGAYYTNGC NGTNACNMGN 


1260 


GA ygarg cng 


CNAARYTNCA 


RACNACNAAR GTNAARAARC CNACNGGNAC NMGNAAYYTN 


1320 


TAYYTNGCNM 


GNGARGGNYT 


NATHMGNGCN GGNACNAARG CNGCNGARGG NGTNWSNGCN 


13B0 


gcngayatgg 


CNAARMGNGA 


RMGNTTYGAR YTNYTNAARC AYCARAARYT NAARGAYCAR 


1440 


aayathttyg 


TNWSNMGNAC 


NMGNYTNTGY YTNCAYAAYY TNCCNAARGC NGTNGAYGAY 


1500 


aarcarytnm 


GNAARYTNYT 


NYTNWSNGCN ACNWSNGGNG ARAARGGNGT NMGNATHAAR 


1560 


gartgymgng 


TNATGMGNGA 


YYTNAARGGN GTNCAYGGNA AYATGAARGG NCARWSNYTN 


1620 


ggntaygcnt 


TYGCNGARTT 


YCARGARCAY GARCAYGCNY TNAARGCNYT NMGNYTNATH 


1680 


AAYAAYAAYC 


CNGARATHTT 


YGGNCCNYTN AARMGNCCNA THGTNGARTT YWSNYTNGAR 


1740 


gaymgnmgna 


ARYTNAARAT 


GAARGARYTN MGNATHCARM GNWSNYTNCA RAARATGMGN 


1800 


WSNAARCCNG 


CNACNGGNGA 


RCCNCARAAR GGNCARCCNG ARCCNGCNAA RGAYCARCAR 


1860 


CARAARGCNG 


CNCARCAYCA 


YACNGARGAR CARWSNAARG TNCCNCCNGA RCARAARMGN 


1920 


AARGCNGGNW 


SNACNWSNTG 


GACNGGNTTY CARACNAARG CNGARGTNGA RCARGTNGAR 


1980 
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YTNCCNGAYG 


GNAARAARMG 


NMGNAARGTN 


YTNGCNYTNC 


CNWS NCA Y MG 




& U 


ATHMGNYTNM 


GNGAYAARGG 


NAARGTNAAR 


CCNGTNCAYC 


CNAARAARCC 


NAARCCNCAR 


2100 


ATHAAYCART 


GGAARCARGA 


RAARCARCAR 


YTNWSNWSNG 


ARCARGTNWS 


NMGNAARAAR 


2160 


GCNAARGGNA 


AYAARACNGA 


RACNMGNTTY 


AAYCARYTNG 


TNGAR CARTA 


YAARCARAAR 


2220 


YTNYTNGGNC 


CNWSNAARGG 


NGCNCCNYTN 


GCNAARMGNW 


SNAARTGGTT 


YGAYWSN 


2277 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



TGAGCTTGGT 


TGTCCTACCA 


AAGCCAGCGT 


TTCGGCTCGC 


GTGCGCCGGC 


CTAGTTTGCT 


60 


CGCGTCCTCA 


CGCGCTTTGG 


GTTTCCCGGT 


CTCATGGCCG 


GCCTGACCTT 


ATTTGTGGGC 


12 0 


CGCCTCCCGC 


CCTCGGCCCG 


CAGTGAGCAG 


CTGGAGGAAC 


TGTTCAGTCA 


GGTGGGGCCG 


180 


GTGAAGCAGT 


GCTTCGTGGT 


GACTGAAAAA 


GGGAGTAAGG 


CATGTCGAGG 


CTTTGGCTAT 


240 


GTCACTTTTT 


CAATGCTGGA 


AGATGTTCAG 


AGGGCCCTCA 


AGGAGATTAC 


CACCTTTGAA 


300 


GGTTGCAAGA 


TCAACGTGAC 


TGTTGCCAAG 


AAAAAACTGA 


GGAACAAGAC 


AAAGGAAAAG 


360 


GGGAAAAATG 


AAAACTCAGA 


GTGCCCAAAG 


AAGGAGCCGA 


AGGCTAAAAA 


AGCCAAAGTG 


420 


GCAGATAAGA 


AAGCCAGATT 


AATTATTCGG 


AACCTGAGCT 


TTAAGTGTTC 


AGAAGATGAC 


480 


TTGAAGACAG 


TATTTGCTCA 


ATTTGGAGCT 


GTCCTGGAAG 


TAAATATCCC 


TAGGAAACCA 


540 


GATGGGAAGA 


TGCGCGGTTT 


TGGTTTTGTT 


CAGTTCAAAA 


ACCTCCTAGA 


AGCAGGTAAA 


600 


GCTCTCAAAG 


GCATGAACAT 


GAAAGAGATA 


AAAGGCCGGA 


CAGTGGCTGT 


GGATTGGGCC 


660 


GTGGCAAAGG 


ATAAATATAA 


AGATACACAG 


TCTGTTTCTG 


CTATAGGTGA 


GGAAAAGAGC 


720 


CATGAATCTA 


AACATCAGGA 


ATCAGTTAAA 


AAGAAGGGCA 


GAGAGGAAGA 


GGATATGGAA 


780 


GAGGAAGAAA 


ACGATGATGA 


TGACGATGAT 


GATGATGAAG 


AAGATGGGGT 


TTTTGATGAT 


840 


GAAGATGAAG 


AGGAAGAGAA 


TATAGAATCA 


AAGGTGACCA 


AGCCTGTGCA 


AATTCAGAAG 


900 


AGAGCAGTCA 


AGAGACCAGC 


CCCTGCAAAA 


AGCAGTGATC 


ATTCTGAGGA 


GGACAGTGAC 


960 


CTAGAGGAAA 


GCGATAGTAT 


TGATGATGGA 


GAGGAACTGG 


CTCAGAGTGA 


TACCAGCACT 


1020 


GAGGAGCAAG 


AGGATAAAGC 


TGTGCAAGTC 


TCAAACAAAA 


AGAAGAGGAA 


ATTACCCTCT 


1080 


GATGTGAATG 


AAGGGAAAAC 


TGTTTTTATC 


AGAAATCTGT 


CCTTTGACTC 


AGAAGAAGAA 


1140 


GAACTTGGGG 


AGCTTCTCCA 


ACAGTTTGGA 


GAACTCAAAT 


ATGTCCGCAT 


TGTCTTGCAT 


1200 


CCAGACACAG 


AGCATTCTAA 


AGGTTGTGCA 


TTTGCCCAGT 


TCATGACTCA 


AGAAGCAGCT 


1260 
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CAGAAATGCC TTCTAGCTGC TTCTCCAGAG AATGAGGCTG GTGGGCTTAA ACTGGATGGC 1320 

CGGCAGCTCA AGGTTGACTT GGCGGTGACC CGTGATGAGG CTGCAAAGCT TCAGACGACG 1380 

AAGGTGAAGA AGCCGACTGG CACCCGGAAT CTCTATCTGG CCCGAGAAGG CTTGATTCGT 144 0 

GCTGGGACGA AGGCTGCAGA GGGTGTGAGT GCTGCTGATA TGGCCAAAAG AGAACGGTTT 150 0 

GAGCTGCTGA AGCATCAGAA ACTCAAGGAC CAGAATATCT TTGTCTCCCG AACCAGGCTC 1560 

TGCCTGCACA ATCTCCCAAA GGCTGTAGAT GACAAACAGC TCAGAAAGCT GCTGCTGAGT 162 0 

GCTACTAGTG GAGAGAAAGG GGTGCGCATC AAGGAGTGTA GAGTGATGCG AGACCTCAAA 1680 

GGAGTTCATG GGAACATGAA GGGTCAGTCC CTGGGCTACG CCTTTGCGGA GTTCCAAGAG 1740 

CACGAGCATG CCCTGAAAGC CCTCCGCCTC ATCAACAACA ATCCAGAAAT CTTTGGGCCT 1800 

CTGAAGAGAC CAATAGTGGA GTTCTCTTTA GAAGATCGAA GAAAACTTAA AATGAAGGAA I860 

TTAAGGATCC AGCGCAGCTT GCAAAAAATG AGATCCAAGC CTGCAACTGG TGAGCCTCAG 1920 

AAGGGGCAAC CAGAGCCTGC AAAAGACCAG CAACAGAAGG CAGCTCAACA CCACACAGAG 1980 

GAACAAAGCA AGGTGCCCCC AGAGCAGAAG AGAAAGGCGG GCTCTACCTC ATGGACCGGG 2040 

TTCCAGACCA AGGCTGAAGT GGAGCAGGTG GAGCTGCCTG ATGGAAAGAA GAGAAGAAAG 2100 

GTCCTGGCGC TCCCCTCACA CCGAGGCCCC AAAATCAGGT TGCGGGACAA AGGCAAAGTG 2160 

AAGCCCGTCC ATCCCAAAAA GCCAAAGCCA CAGATAAACC AGTGGAAGCA GGAGAAGCAG 22 20 

CAATTATCGT CCGAGCAGGT ATCTAGGAAA AAAGCTAAGG GAAATAAGAC GGAAACCCGC 2280 

TTCAACCAGC TGGTCGAACA ATATAAGCAG AAATTATTGG GACCTTCTAA AGGAGCACCT 2340 

CTTGCAAAGA GGAGCAAATG GTTTGATAGT TGATGATGGC AGCAGGCTGG GTAAGAAGCT 2400 

GGGTTGTATA CTTTCTGGTG ACACTCCTGG GCTCCTCCCC ATCCCCCGTG TCTCTCACTG 24 60 

AGGGAAAGAA AATCCCCAAG GGCACTGCCA CTGTGCTCGG AGGTGCCCTG GACTGTGTAC 2520 

ATCTGAACTT TGGTCCATCC TTTGATGTGT GGTTCGTTAG CCACAAAGAG AAATATCTGA 2580 

AAGTCAACAT GATGCTTCTT GCATATTATC CAGATTATTG TATGAAGTTG TGTCTATAAT 264 0 

TATTACCAAT TTTTATTCTT TATTTCTCAA ATGGAAACAC CTGAAAAAGC AAAAAAAAAA 2700 

AAAAAAAAAA CTCGAGGGGG GCCCGTACCC AAT 2733 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2277 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CONA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

AUGGCUGGUC UGACCCUGUU CGUUGGUCGU CUGCCGCCGU CCGCUCGUUC CGAACAGCUG 60 
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GAAGAACUGU 


TT/tTTOf*!"* A OOT1 

UCUC ccagg u 






UCCAAAGCUU 


GCCGUGGUUU 


CGGUUALGUU 




GCUCUGAAAG 


AAAUCACCAC 


/■III TV * nnf* 

CUUCGAAGGU 




AAACUGCGUA 


•m TV TV H /"*^TV TV 

ACAAAACCAA 


Af3A A A RR/"?OTT 

AGAAAAAGGu 




GAACCGAAAG 


Mrill TV * TV TV TV rtf* 

CUAAAAAAGC 


rtji TV Ivr'TTTTrT'lT 


5 


CUGUCCUUCA 


AAUGCU C CGA 


AGACGACCUU 




CUGGAAGUUA 


ACAUCC CGCG 


UAAALCGGAt. 




UUCAAAAACC 


UGCUGGAAGC 


UGGUAAAGCU 




GGUCGUACCG 


UUGCUGUUGA 


CUGGGCUGUU 




GUUUCCGCUA 


UCGGUGAAGA 


AAAAUCCCAC 


10 


AAAGGUCGUG 


AAGAAGAAGA 


CAUGGAAGAA 




GACGAAGAAG 


ACGGUGUUUU 


CGACGACGAA 




GUUACCAAAC 


CGGUUCAGAU 


CCAGAAACGU 




UCCGACCACU 


CCGAAGAAGA 


CUCCG AC CUG 




GAACUGGCUC 


AGUCCGACAC 


CUCCACCGAA 


15 


AACAAAAAAA 


AACGUAAACU 


GCCGUCCGAC 




AACCUGUCCU 


UCGACUCCGA 


» O TVTVOTVTVOTVTV 

AGAAGAAGAA 




CUGAAAUACG 


UUCGUAUCGU 


UCUGCACCCG 




GCUCAGUUCA 


UGACCCAGGA 


AGCUGCuCAG 




GAAGCUGGUG 


GUCUGAAACU 


GGACGGUCGU 


20 


GACGAAGCUG 


CUAAACUGCA 


GACCACCAAA 




UACCUGGCUC 


GUGAAGGUCU 


GAUCCGUGCU 




GCUGACAUGG 


CUAAACGUGA 


TV O/'iTTTTTT/'^O TV TV 

ACGUUUCGAA 




AACAUCUUCG 


UUUCCCGUAC 


CCGUCuGUGC 




AAACAGCUGC 


GUAAACUGCU 


GCUGUCCGCU 


25 


GAAUGCCGUG 


UUAUGCGUGA 


CCUGAAAGGU 




GGUUACGCUU 


UCGCUGAAUU 


CCAGGAACAC 




AACAACAACC 


CGGAAAUCUU 


CGGUCCGCUG 




GACCGUCGUA 


AACUGAAAAU 


GAAAGAACUG 




UCCAAACCGG 


CUACCGGUGA 


ACCGCAGAAA 


30 


CAGAAAGCUG 


CUCAGCACCA 


CACCGAAGAA 




AAAGCUGGUU 


CCACCUCCUG 


GACCGGUUUC 




CUGCCGGACG 


GUAAAAAACG 


UCGUAAAGUU 




AUCCGUCUGC 


GUGACAAAGG 


UAAAGUUAAA 




AUCAACCAGU 


GGAAACAGGA 


AAAACAGCAG 


35 


GCUAAAGGUA 


ACAAAACCGA 


AACCCGUUUC 




CUGCUGGGUC 


CGUCCAAAGG 


UGCUCCGCUG 



TV f\ A ft f**l Tf5 CI T 

AAALAUUbL U 


T TPflT Tf T/3T TT T 7A P 


Lunniuinuu U 


1 0(\ 


TV /"•/TTTTOTTO/"' TV 


t xr*f*nns* a Af a 
UGLUGGAAGA 


Hf f If 1 r* 7A n pr*r 7 
L.vjU U vJAuUvj U 




TTT^OTV TV TV TV TTOTV 

UG C AAAAU C A 


ACGUUACUGU 


TT/^/^TlTA TV TV TA TA A 

UGCUAAAAAA 




TV TV TV T\ AfV» A TV TV 

AAAAACGAAA 


ft PTTrrf 7A 2VTTP 


HHCfZ h. 7A & TA 7A fi 


JUU 


r» TAP A. A. A & TA 7Af2 


PTTPrtlTPTTnTATT 




-a /r o 


TV TV TV TV OOOTTTTTT 

AAAALLGUUU 


lTrV3PTTPA./5TTTT 


Pfl (TTTfiPT TfSTTTT 


a *5 n 


OPTTA 7A & 7ATTP.P 


rTTTPfSTTTTT TfW3 
uUuuUUUvyVio 


TTTTTTPflTTITPJAfi 




OTTO* TV TV /^^fTTTV 

CUGAAAGGUA 


?Tf A TV O TV TTO TA TV 

UGAACAUGAA 


TV O TV TV ATTOA 71 A 

AG AAA U L AAA 


C A A 


GCUAAAGACA 


AAUACAAAGA 


CACCCAGUCC 


600 


GAAUCCAAAC 


ACCAGGAAUC 


CGUUAAAAAA 


660 


GAAGAAAACG 


ACGACGACGA 


CGACGACGAC 


720 


GACGAAGAAG 


AAGAAAACAU 


CGAAUCCAAA 


780 


GCUGUUAAAC 


GUCCGGCUCC 


O^^f TTV TV TV TTf^r^ 

GGCUAAAuCC 


840 


GAAGAAU C CG 


ACUCCAUCGA 


i^/^TV^V'rtTTr'TV TV 

CGACGGUGAA 


900 


gaacaggaag 


IV ^1V TV IV ^ <TT T^l T T 

ACAAAGCUGU 


UCAGGUUUCL 


q £ n 

you 


^TTTTH IV IV TV /"l 

GUUAACGAAG 


^tTTTV TV TV TV prVTT 

GUAAAAUvJU U 


U U U v,AU v- L.Va U 




fvf TOOOTT/** TV A /"< 

CUGGGUGAAC 


tT/-t /^t T/^OTV/^OA 

UGCUUCAuCA 


ptti ir*nmirz ta 7a 


1UOU 


GACACCGAAC 


ACUC CAAAGG 


UUGCGCUUUC 


1 u 


AAAUGCCUGC 


UGGCUGCUUC 


/~*f^f*f*r^ TV TV TV TV f 

CCCGGAAAAt. 


1 "JftA 

uu 


CAGCUGAAAG 


UUGACCUGGC 


UGUUACCCGU 


"I ^ £ A 


GUUAAAAAAC 


CGACCG GUAC 


CCGUAACCUG 


T T *) A 

X J 4l U 


GGUAC CAAAG 


/^TT/^(*T1/^ TV TV Of** 

CU G CUG AAGG 


T tr'T TT TT TO r*T T 


1 1 ft n 


CUGCUGAAAC 


TV ^^*TV^>TVTV TV ^*TT 

AC CAGAAACU 


ota a apriPniin 
GAAAG AL. u no 


1 A A A 


CUGCACAACC 


t v^t iv iv tv 

UG CCGAAAGC 


T TOTTT T/^ A /"V5 A O 

UuUUuAvJUAv, 


1 RAA 


ACCUCCGGUG 


AAAAAGGUGU 


UCGUAUCAAA 


1 C <T A 


GUUCACGGUA 


ACAUGAAAGG 


UCAGUCCCUG 


1620 


GAACACGCUC 


UGAAAGCUCU 


GCGUCUGAUC 


1680 


AAACGUCCGA 


UCGUUGAAUU 


CUCCCUGGAA 


174 0 


CGUAUCCAGC 


GUUC C CUG CA 


TV TV TV TVTTr^^r^TT 

GAAAAUGCGU 


1 O A A 
1UUU 


GGUCAGCCGG 


AACCGGCUAA 


AGAC CAGCAG 


i sen 
loo U 


CAGUCCAAAG 


UUCCGCCGGA 


TV ^ IV TV TV TV /^f^TT 

ACAGAAACGU 




CAGACCAAAG 


CUGAAGUUGA 


m « » OT TT TO IV TV 

ACAGGUUGAA 


1 O O A 


CUGGOJCUGC 


CGUCCCACCG 


UGGUCCGAAA 


2040 


CCGGUUCACC 


CGAAAAAACC 


GAAACCGCAG 


2100 


CUGUCCUCCG 


AACAGGUUUC 


CCGUAAAAAA 


2160 


AACCAGCUGG 


UUGAACAGUA 


CAAACAGAAA 


2220 


GCUAAACGUU 


CCAAAUGGUU 


CGACUCC 


2277 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2277 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

10 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5: 

ATGGCCGGCC TGACCCTGTT CGTGGGCCGC CTGCCCCCCA GCGCCCGCAG CGAGCAGCTG 60 

GAGGAGCTGT TCAGCCAGGT GGGCCCCGTG AAGCAGTGCT TCGTGGTGAC CGAGAAGGGC 120 

AGCAAGGCCT GCCGCGGCTT CGGCTACGTG ACCTTCAGCA TGCTGGAGGA CGTGCAGCGC 1B0 

GCCCTGAAGG AGATCACCAC CTTCGAGGGC TGCAAGATCA ACGTGACCGT GGCCAAGAAG 24 0 

15 AAGCTGCGCA ACAAGACCAA GGAGAAGGGC AAGAACGAGA ACAGCGAGTG CCCCAAGAAG 300 

GAGCCCAAGG CCAAGAAGGC CAAGGTGGCC GACAAGAAGG CCCGCCTGAT CATCCGCAAC 360 

CTGAGCTTCA AGTGCAGCGA GGACGACCTG AAGACCGTGT TCGCCCAGTT CGGCGCCGTG 42 0 

CTGGAGGTGA ACATCCCCCG CAAGCCCGAC GGCAAGATGC GCGGCTTCGG CTTCGTGCAG 480 

TTCAAGAACC TGCTGGAGGC CGGCAAGGCC CTGAAGGGCA TGAACATGAA GGAGATCAAG 54 0 

20 GGCCGCACCG TGGCCGTGGA CTGGGCCGTG GCCAAGGACA AGTACAAGGA CACCCAGAGC 600 

GTGAGCGCCA TCGGCGAGGA GAAGAGCCAC GAGAGCAAGC ACCAGGAGAG CGTGAAGAAG 660 

AAGGGCCGCG AGGAGGAGGA CATGGAGGAG GAGGAGAACG ACGACGACGA CGACGACGAC 720 

GACGAGGAGG ACGGCGTGTT CGACGACGAG GACGAGGAGG AGGAGAACAT CGAGAGCAAG 780 

GTGACCAAGC CCGTGCAGAT CCAGAAGCGC GCCGTGAAGC GCCCCGCCCC CGCCAAGAGC 84 0 

25 AGCGACCACA GCGAGGAGGA CAGCGACCTG GAGGAGAGCG ACAGCATCGA CGACGGCGAG 900 

GAGCTGGCCC AGAGCGACAC CAGCACCGAG GAGCAGGAGG ACAAGGCCGT GCAGGTGAGC 960 

AACAAGAAGA AGCGCAAGCT GCCCAGCGAC GTOAACGAGG GCAAGACCGT GTTCATCCGC 1020 

AACCTGAGCT TCGACAGCGA GGAGGAGGAG CTGGGCGAGC TGCTGCAGCA GTTCGGCGAG 1080 

CTGAAGTACG TGCGCATCGT GCTGCACCCC GACACCGAGC ACAGCAAGGG CTGCGCCTTC 1140 

30 GCCCAGTTCA TGACCCAGGA GGCCGCCCAG AAGTGCCTGC TGGCCGCCAG CCCCGAGAAC 1200 

GAGGCCGGCG GCCTGAAGCT GGACGGCCGC CAGCTGAAGG TGGACCTGGC CGTGACCCGC 1260 

GACGAGGCCG CCAAGCTGCA GACCACCAAG GTGAAGAAGC CCACCGGCAC CCGCAACCTG 1320 

TACCTGGCCC GCGAGGGCCT GATCCGCGCC GGCACCAAGG CCGCCGAGGG CGTGAGCGCC 1380 

GCCGACATGG CCAAGCGCGA GCGCTTCGAG CTGCTGAAGC ACCAGAAGCT GAAGGACCAG 144 0 

35 AACATCTTCG TGAGCCGCAC CCGCCTGTGC CTGCACAACC TGCCCAAGGC CGTGGACGAC 1500 

AAGCAGCTGC GCAAGCTGCT GCTGAGCGCC ACCAGCGGCG AGAAGGGCGT GCGCATCAAG 1560 

GAGTGCCGCG TGATGCGCGA CCTGAAGGGC GTGCACGGCA ACATGAAGGG CCAGAGCCTG 162 0 
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GGCTVACGCCT 


TCGCCGAGTT 


CCAGGAGCAC 


GAGCACGCCC 


TGAAGGCCCT 


GCGCCTGATC 


1680 


a ap a a haac c 


CCGAGATCTT 


CGGCCCCCTG 


AAGCGCCCCA 


TCGTGGAGTT 


CAGCCTGGAG 


1740 


A fVGPCG PA 


AGCTGAAGAT 


GAAGGAGCTG 


CGCATCCAGC 


GCAGCCTGCA 


GAAGATGCGC 


1800 


AGPAAGCCCG 


CCACCGGCGA 


GCCCCAGAAG 


GGCCAGCCCG 


AGCCCGCCAA 


GGACCAGCAG 


1860 


fAGAAGGCCG 


CCCAGCACCA 


CACCGAGGAG 


CAGAGCAAGG 


TGCCCCCCGA 


GCAGAAGCGC 


1920 




GCACCAGCTG 


GACCGGCTTC 


CAGACCAAGG 


CCGAGGTGGA 


GCAGGTGGAG 


1980 


CTGCCCGACG 


GCAAGAAG CG 








CGGGPPPAAG 


2040 


ATCCGCCTGC 


GCGACAAGGG 


CAAGGTGAAG 


CCCGTGCACC 


CCAAGAAGCC 


CAAGCCCCAG 


2100 


ATCAACCAGT 


GGAAGCAGGA 


GAAGCAGCAG 


CTGAGCAGCG 


AGCAGGTGAG 


CCGCAAGAAG 


2160 


GCCAAGGGCA 


ACAAGACCGA 


GACCCGCTTC 


AACCAGCTGG 


TGGAGCAGTA 


CAAGCAGAAG 


2220 


CTGCTGGGCC 


CCAGCAAGGG 


CGGCCCCCTG 


GCCAAGCGCA 


GCAAGTGGTT 


CGACAGC 


2277 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



GGGTTGCGGA 


GGGTGGGCCT 


GGGAGGGGTG 


GTGGCCATTT 


TTTGTCTAAC 


CCTAACTGAG 


60 


AAGGGCGTAG 


GCGCCGTGCT 


TTTGCTCCCC 


GCGCGCTGTT 


TTTCTCGCTG 


ACTTTCAGCG 


120 


GGCGGAAAAG 


CCTCGGCCTG 


CCGCCTTCCA 


CCGTTCATTC 


TAGAGCAAAC 


AAAAAATGTC 


180 


AGCTGCTGGC 


CCGTTCGCCC 


CTCCCGGGGA 


CCTGCGGCGG 


GTCGCCTGCC 


CAGCCCCCGA 


240 


ACCCCGCCTG 


GAGGCCGCGG 


TCGGCCCGGG 


GCTTCTCCGG 


AGGCACCCAC 


TGCCACCGCG 


300 


AAGAGTTGGG 


CTCTGTCAGC 


CGCGGGTCTC 


TCGGGGGCGA 


GGGCGAGGTT 


CAGGCCTTTC 


360 


AGGCCGCAGG 


AAGAGGAACG 


GAGCGAGTCC 


CCGCGCGCGG 


CGCGATTCCC 


TGAGCTGTGG 


420 


GACGTGCACC 


CAGGACTCGG 


CTCACACATG 


CAGTTCGCTT 


TCCTGTTGGT 


GGGGGGAACG 


480 


CCGATCGTGC 


GCATCCGTCA 


CCCCTCGCCG 


GCAGTGGGGG 


CTTGTGAACC 


CCCAAACCTG 


540 



(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 



25 



AVO 98/01543 



PCT/US97/12297 



(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GGGTTGCGGA GGGTGGGCCT GGOAGGGGTG GTGGCCATTT TTTGTCCAAC CCCAACTGAG 60 

AAGGGCGTAG GCGCCGTGCT TTTGCTCCCC GCGCGCTGTT TTTCTCGCTG ACTTTCAGCG 120 

GGCGGAAAAG CCTCGGCCTG CCGCCTTCCA CCGTTCATTC TAGAGCAAAC AAAAAATGTC 180 

AGCTGCTGGC CCGTTCGCCC CTCCCGGGGA CCTGCGGCGG GTCGCCTGCC CAGCCCCCGA 240 

ACCCCGCCTG GAGGCCGCGG TCGGCCCGGG GCTTCTCCGG AGGCACCCAC TGCCACCGCG 3 00 

AAGAGTTGGG CTCTGTCAGC CGCGGGTCTC TCGGGGGCGA GGGCGAGGTT CAGGCCTTTC 360 

AGGCCGCAGG AAGAGGAACG GAGCGAGTCC CCGCGCGCGG CGCGATTCCC TGAGCTGTGG 420 

GACGTGCACC CAGGACTCGG CTCACACATG CAGTTCGCTT TCCTGTTGGT GGGGGGAACG 4 80 

CCGATCGTGC GCATCCGTCA CCCCTCGCCG GCAGTGGGGG CTTGTGAACC CCCAAACCTG 540 

(2) INFORMATION FOR SEQ ID NO : 8 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



GGGTTGCGGA 


GGGTGGGCCT 


GGGAGGGGTG 


GTGGCCATTT 


TTTGTCTAAG 


CCTAAGTGAG 


60 


AAGGGCGTAG 


GCGCCGTGCT 


TTTGCTCCCC 


GCGCGCTGTT TTTCTCGCTG ACTTTCAGCG 


120 


GGCGGAAAAG 


CCTCGGCCTG 


CCGCCTTCCA 


CCGTTCATTC 


TAGAGCAAAC 


AAAAAATGTC 


180 


AGCTGCTGGC 


CCGTTCGCCC 


CTCCCGGGGA 


CCTGCGGCGG 


GTCGCCTGCC 


CAGCCCCCGA 


240 


ACCCCGCCTG 


GAGGCCGCGG 


TCGGCCCGGG 


GCTTCTCCGG 


AGGCACCCAC 


TGCCACCGCG 


300 


AAGAGTTGGG 


CTCTGTCAGC 


CGCGGGTCTC 


TCGGGGGCGA 


GGGCGAGGTT 


CAGGCCTTTC 


360 


AGGCCGCAGG 


AAGAGGAACG 


GAGCGAGTCC 


CCGCGCGCGG 


CGCGATTCCC 


TGAGCTGTGG 


420 


GACGTGCACC 


CAGGACTCGG 


CTCACACATG 


CAGTTCGCTT 


TCCTGTTGGT 


GGGGGGAACG 


480 


CCGATCGTGC 


GCATCCGTCA 


CCCCTCGCCG 


GCAGTGGGGG 


CTTGTGAACC 


CCCAAACCTG 


540 



(2) INFORMATION FOR SEQ ID NO: 9: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 538 bas pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGGTTGCGGA GGGTGGGCCT GGGAGGGGTG GTGGCCATTT TTTGTCTACC CTACTGAGAA 60 

10 GGGCGTAGGC GCCGTGCTTT TGCTCCCCGC GCGCTGTTTT TCTCGCTGAC TTTCAGCGGG 120 

CGGAAAAGCC TCGGCCTGCC GCCTTCCACC GTTCATTCTA GAGCAAACAA AAAATGTCAG 180 
CTGCTGGCCC GTTCGCCCCT CCCGGGGACC TGCGGCGGGT CGCCTGCCCA GCCCCCGAAC 240 
CCCGCCTGGA GGCCGCGGTC GGCCCGGGGC TTCTCCGGAG GCACCCACTG CCACCGCGAA 300 
GAGTTGGGCT CTGTCAGCCG CGGGTCTCTC GGGGGCGAGG GCGAGGTTCA GGCCTTTCAG 360 

15 GCCGCAGGAA GAGGAACGGA GCGAGTCCCC GCGCGCGGCG CGATTCC CTG AGCTGTGGGA 420 

CGTGCACCCA GGACTCGGCT CACACATGCA GTTCGCTTTC CTGTTGGTGG GGGGAACGCC 480 
GATCGTGCGC ATCCGTCACC CCTCGCCGGC AGTGGGGGCT TGTGAACCCC CAAACCTG 538 



20 



(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
25 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 5 . . 13 

30 (D) OTHER INFORMATION: /note= "Xaa represents isoleucine 

or leucine" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
Glu Ala Ala Thr Xaa Asp Xaa Pro Gin Gin Gly Ala Asn Lys 
35 i 5 io 
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WHAT IS CLAIMED IS: 

1 . An isolated nucleic acid comprising SEQ ID NO:3, or a portion thereof encoding a 
telomerase protein pl05 (SEQ ID NO:l) domain having human telomerase-specific activity. 

2. An isolated nucleic acid according to claim 1 , wherein said domain specifically binds 
5 at least one of the telomerase RNA of SEQ ID NO: 6, a telomerase subunit, substrate, 

agonist, antagonist, chaperone, regulatory protein or cytoskeletal protein. 

3. An isolated nucleic acid comprising a portion of SEQ ID NO: 3, bases 1-2345, which 
specifically hybridizes with, or amplifies from a nucleic acid having the sequence defined by 

10 SEQIDNO:3. 

4. A method of modulating the expression of a telomerase transcript, said method 
comprising steps: contacting inside a cell an endogenous transcript encoding a telomerase 
protein with a nucleic acid according to claim 3 under conditions whereby said nucleic acid 

15 hybridizes with said transcript, whereby the expression of said transcript is modulated. 

5. A recombinant nucleic acid consisting of an open reading frame comprising SEQ ID 
NO:3, or a portion thereof sufficient to encode a telomerase protein pi 05 (SEQ ID NO: 1) 
domain having human telomerase-specific activity. 

20 

6. A recombinant nucleic acid according to claim 5, wherein said open reading frame 
comprises SEQ ID NO:3, bases 97-2370. 

7. A cell comprising a nucleic acid according to claim 5. 

25 

8. A method of making an isolated telomerase protein, comprising steps: introducing a 
nucleic acid according to claim 5 into a host cell or cellular extract, incubating said host cell 
or extract under conditions whereby said nucleic acid is expressed as a transcript and said 
transcript is expressed as a translation product comprising said protein, and isolating said 

30 translation product. 
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9. An isolated human telomerase made by the method of claim 8. 

10. A method of screening for an agent which modulates the binding of a human 
telomerase to a binding target, said method comprising the steps of: 

translating the nucleotide sequence of SEQ ID NO:3 of a nucleic acid according to 
claim 5 to obtain a human telomerase protein domain; 
incubating a mixture comprising: 

a telomerase or telomerase protein comprising said domain, 

a binding target of said telomerase protein, and 

a candidate agent; 

under conditions whereby, but for the presence of said agent, said telomerase or 
telomerase protein specifically binds said binding target at a reference affinity; 

detecting the binding affinity of said telomerase or telomerase protein to said binding 
target to determine an agent-biased affinity, 

wherein a difference between the agent-biased affinity and the reference affinity 
indicates that said agent modulates the binding of said telomerase or telomerase protein to 
said binding target. 

11. A method according to claim 10, wherein said binding target is a substrate of said 
telomerase and said reference and agent-biased binding affinity are each detected as the 
polymerization by said telomerase of a nucleic acid on said substrate. 
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